OcrV1, Main, Exploration, bibRecord, 000030

Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight

Identifieur interne : 000030 ( Main/Exploration ); précédent : 000029; suivant : 000031

Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight

Auteurs : Michael Cutter ; Roberto Manduchi

Source :

Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering ; 2015.

RBID : PMC:4677830

Abstract

The advent of mobile OCR (optical character recognition) applications on regular smartphones holds great promise for enabling blind people to access printed information. Unfortunately, these systems suffer from a problem: in order for OCR output to be meaningful, a well-framed image of the document needs to be taken, something that is difficult to do without sight. This contribution presents an experimental investigation of how blind people position and orient a camera phone while acquiring document images. We developed experimental software to investigate if verbal guidance aids in the acquisition of OCR-readable images without sight. We report on our participant's feedback and performance before and after assistance from our software.

Url:

http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4677830

DOI: 10.1145/2682571.2797066
PubMed: 26677461
PubMed Central: 4677830

Affiliations:

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight</title>
<author><name sortKey="Cutter, Michael" sort="Cutter, Michael" uniqKey="Cutter M" first="Michael" last="Cutter">Michael Cutter</name>
</author>
<author><name sortKey="Manduchi, Roberto" sort="Manduchi, Roberto" uniqKey="Manduchi R" first="Roberto" last="Manduchi">Roberto Manduchi</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PMC</idno>
<idno type="pmid">26677461</idno>
<idno type="pmc">4677830</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4677830</idno>
<idno type="RBID">PMC:4677830</idno>
<idno type="doi">10.1145/2682571.2797066</idno>
<date when="2015">2015</date>
<idno type="wicri:Area/Pmc/Corpus">000141</idno>
<idno type="wicri:Area/Pmc/Curation">000141</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000011</idno>
<idno type="wicri:source">PubMed</idno>
<idno type="wicri:Area/PubMed/Corpus">000002</idno>
<idno type="wicri:Area/PubMed/Curation">000002</idno>
<idno type="wicri:Area/PubMed/Checkpoint">000002</idno>
<idno type="wicri:Area/Ncbi/Merge">000246</idno>
<idno type="wicri:Area/Ncbi/Curation">000246</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000246</idno>
<idno type="wicri:Area/Main/Merge">000028</idno>
<idno type="wicri:Area/Main/Curation">000030</idno>
<idno type="wicri:Area/Main/Exploration">000030</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a" type="main">Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight</title>
<author><name sortKey="Cutter, Michael" sort="Cutter, Michael" uniqKey="Cutter M" first="Michael" last="Cutter">Michael Cutter</name>
</author>
<author><name sortKey="Manduchi, Roberto" sort="Manduchi, Roberto" uniqKey="Manduchi R" first="Roberto" last="Manduchi">Roberto Manduchi</name>
</author>
</analytic>
<series><title level="j">Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering</title>
<imprint><date when="2015">2015</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en"><p id="P1">The advent of mobile OCR (optical character recognition) applications on regular smartphones holds great promise for enabling blind people to access printed information. Unfortunately, these systems suffer from a problem: in order for OCR output to be meaningful, a well-framed image of the document needs to be taken, something that is difficult to do without sight. This contribution presents an experimental investigation of how blind people position and orient a camera phone while acquiring document images. We developed experimental software to investigate if verbal guidance aids in the acquisition of OCR-readable images without sight. We report on our participant's feedback and performance before and after assistance from our software.</p>
</div>
</front>
</TEI>
<affiliations><list></list>
<tree><noCountry><name sortKey="Cutter, Michael" sort="Cutter, Michael" uniqKey="Cutter M" first="Michael" last="Cutter">Michael Cutter</name>
<name sortKey="Manduchi, Roberto" sort="Manduchi, Roberto" uniqKey="Manduchi R" first="Roberto" last="Manduchi">Roberto Manduchi</name>
</noCountry>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000030 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000030 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     PMC:4677830
   |texte=   Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/RBID.i   -Sk "pubmed:26677461" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd   \
       | NlmPubMed2Wicri -a OcrV1

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024

Serveur d'exploration sur l'OCR

Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight

Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight

Source :

Abstract

Links toward previous steps (curation, corpus...)

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri

Pour générer des pages wiki

	Serveur d'exploration sur l'OCR
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.